NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

DroEdgeEM: A Drone-Edge Collaborative Emulation Platform for Emerging Situation-aware Applications

https://doi.org/10.1145/3769102.3774248

Shrestha, Summit; Muliashia, Rit; Rao, Rohit; Sarma, Anirudh; Nussbaum, Alan; Lee, Myungjin; Ramachandran, Umakishore (December 2025, ACM)

Free, publicly-accessible full text available December 3, 2026
OPCM: Opportunistic Performance-driven Connectivity Management for 5G/xG Networks

https://doi.org/10.1145/3768970

Hassan, Ahmad; Ye, Wei; Zhang, Anlan; Fezeu, Rostand_A K; Carpenter, Jason; Zhu, Ruiyang; Jin, Shuowei; Lee, Myungjin; Jajoo, Akshay; Mao, Morley; et al (November 2025, Proceedings of the ACM on Networking)

5G and future 6G networks deploy cells with diverse combinations of access technologies, architectures, and radio frequency bands/channels. Cellular operators also employ carrier aggregation for higher data access speeds. We investigate the fundamental question of how to intelligently and dynamically configure and reconfigure a user equipment's serving cells to deliver the best network performance. Through comprehensive measurements across 12 cities in 5 countries, we experimentally show the wide availability, heterogeneity, and untapped performance gains of today's cell deployments. We then present a principled, performance-driven connectivity management framework, dubbed OPCM. It is a centralized solution deployed at the base station, allowing it to coordinate multiple UEs, enforce operator policies, and facilitate user fairness. Extensive evaluations show that OPCM improves the application QoE by up to 65.2%.
more » « less
Free, publicly-accessible full text available November 24, 2026
Client Availability in Federated Learning: It Matters!

https://doi.org/10.1145/3721146.3721964

Garg, Dhruv; Sanyal, Debopam; Lee, Myungjin; Tumanov, Alexey; Gavrilovska, Ada (March 2025, ACM)

Free, publicly-accessible full text available March 30, 2026
DepS: Delayed epsilon-Shrinking for Faster Once-for-All Training

https://doi.org/10.1007/978-3-031-73024-5_19

Annavajjala, Aditya; Khare, Alind; Agrawal, Animesh; Fedorov, Igor; Latapie, Hugo; Lee, Myungjin; Tumanov, Alexey (November 2024, Springer Nature Switzerland)

CNNs are increasingly deployed across different hardware, dynamic environments, and low-power embedded devices. This has led to the design and training of CNN architectures with the goal of maximizing accuracy subject to such variable deployment constraints. As the number of deployment scenarios grows, there is a need to find scalable solutions to design and train specialized CNNs. Once-for-all training has emerged as a scalable approach that jointly co-trains many models (subnets) at once with a constant training cost and finds specialized CNNs later. The scalability is achieved by training the full model and simultaneously reducing it to smaller subnets that share model weights (weight-shared shrinking). However, existing once-for-all training approaches incur huge training costs reaching 1200 GPU hours. We argue this is because they either start the process of shrinking the full model too early or too late. Hence, we propose Delayed Epsilon-Shrinking (DepS) that starts the process of shrinking the full model when it is partially trained, which leads to training cost improvement and better in-place knowledge distillation to smaller models. The proposed approach also consists of novel heuristics that dynamically adjust subnet learning rates incrementally, leading to improved weight-shared knowledge distillation from larger to smaller subnets as well. As a result, DepS outperforms state-of-the-art once-for-all training techniques across different datasets including CIFAR10/100, ImageNet-100, and ImageNet-1k on accuracy and cost. It achieves higher ImageNet-1k top1 accuracy or the same accuracy with 1.3x reduction in FLOPs and 2.5x drop in training cost (GPU*hrs).
more » « less
Full Text Available
MetaFL: Privacy-preserving User Authentication in Virtual Reality with Federated Learning

https://doi.org/10.1145/3666025.3699322

Cheng, Ruizhi; Wu, Yuetong; Kundu, Ashish; Latapie, Hugo; Lee, Myungjin; Chen, Songqing; Han, Bo (November 2024, ACM)

Full Text Available
SuperFedNAS: Cost-Efficient Federated Neural Architecture Search for On-device Inference

https://doi.org/10.1007/978-3-031-72986-7_10

Khare, Alind; Agrawal, Animesh; Annavajjala, Aditya; Behnam, Payman; Lee, Myungjin; Latapie, Hugo; Tumanov, Alexey (November 2024, Springer Nature Switzerland)

Neural Architecture Search (NAS) for Federated Learning (FL) is an emerging field. It automates the design and training of Deep Neural Networks (DNNs) when data cannot be centralized due to privacy, communication costs, or regulatory restrictions. Recent federated NAS methods not only reduce manual effort but also help achieve higher accuracy than traditional FL methods like FedAvg. Despite the success, existing federated NAS methods still fall short in satisfying diverse deployment targets common in on-device inference including hardware, latency budgets, or variable battery levels. Most federated NAS methods search for only a limited range of neuro-architectural patterns, repeat them in a DNN, thereby restricting achievable performance. Moreover, these methods incur prohibitive training costs to satisfy deployment targets. They perform the training and search of DNN architectures repeatedly for each case. SuperFedNAS addresses these challenges by decoupling the training and search in federated NAS. SuperFedNAS co-trains a large number of diverse DNN architectures contained inside one supernet in the FL setting. Post-training, clients perform NAS locally to find specialized DNNs by extracting different parts of the trained supernet with no additional training. SuperFedNAS takes O(1) (instead of O(N)) cost to find specialized DNN architectures in FL for any N deployment targets. As part of SuperFedNAS, we introduce MaxNet—a novel FL training algorithm that performs multi-objective federated optimization of DNN architectures (≈5∗108) under different client data distributions. SuperFedNAS achieves upto 37.7\% higher accuracy or upto 8.13x reduction in MACs than existing federated NAS methods.
more » « less
Full Text Available
Boosting Collaborative Vehicular Perception on the Edge with Vehicle-to-Vehicle CommunicationBoosting Collaborative Vehicular Perception on the Edge with Vehicle-to-Vehicle Communication

https://doi.org/10.1145/3666025.3699328

Zhu, Ruiyang; Zhu, Xiao; Zhang, Anlan; Zhang, Xumiao; Sun, Jiachen; Qian, Feng; Qiu, Hang; Mao, Z Morley; Lee, Myungjin (November 2024, ACM)

Full Text Available
On the Predictability of Fine-grained Cellular Network Throughput using Machine Learning Models

Basit, Omar; Dinh, Phuc; Khan, Imran; Kong, Z Jonny; Hu, Y Charlie; Koutsonikolas, Dimitrios; Lee, Myungjin; Liu, Chaoyue (September 2024, Proc. of IEEE MASS 2024)

Networking research has witnessed a renaissance from exploring the seemingly unlimited predictive power of machine learning (ML) models. One such promising direction is throughput prediction – accurately predicting the network bandwidth or achievable throughput of a client in real time using ML models can enable a wide variety of network applications to proactively adapt their behavior to the changing network dynamics to potentially achieve significantly improved QoE. Motivated by the key role of newer generations of cellular networks in supporting the new generation of latency-critical applications such as AR/MR, in this work, we focus on accurate throughput prediction in cellular networks at fine time-scales, e.g., in the order of 100 ms. Through a 4-day, 1000+ km driving trip, we collect a dataset of fine-grained throughput measurements under driving across all three major US operators. Using the collected dataset, we conduct the first feasibility study of predicting fine-grained application throughput in real-world cellular networks with mixed LTE/5G technologies. Our analysis shows that popular ML models previously claimed to predict well for various wireless networks scenarios (e.g., WiFi or singletechnology network such as LTE only) do not predict well under app-centric metrics such as ARE95 and PARE10. Further, we uncover the root cause for the poor prediction accuracy of ML models as the inherent conflicting sample sequences in the fine-grained cellular network throughput data.
more » « less
Full Text Available
On the Predictability of Fine-Grained Cellular Network Throughput Using Machine Learning Models

https://doi.org/10.1109/MASS62177.2024.00018

Basit, Omar; Dinh, Phuc; Khan, Imran; Kong, Z Jonny; Hu, Y Charlie; Koutsonikolas, Dimitrios; Lee, Myungjin; Liu, Chaoyue (September 2024, IEEE)

Networking research has witnessed a renaissance from exploring the seemingly unlimited predictive power of machine learning (ML) models. One such promising direction is throughput prediction – accurately predicting the network bandwidth or achievable throughput of a client in real time using ML models can enable a wide variety of network applications to proactively adapt their behavior to the changing network dynamics to potentially achieve significantly improved QoE. Motivated by the key role of newer generations of cellular networks in supporting the new generation of latency-critical applications such as AR/MR, in this work, we focus on accurate throughput prediction in cellular networks at fine time-scales, e.g., in the order of 100 ms. Through a 4-day, 1000+ km driving trip, we collect a dataset of fine-grained throughput measurements under driving across all three major US operators. Using the collected dataset, we conduct the first feasibility study of predicting fine-grained application throughput in real-world cellular networks with mixed LTE/5G technologies. Our analysis shows that popular ML models previously claimed to predict well for various wireless networks scenarios (e.g., WiFi or singletechnology network such as LTE only) do not predict well under app-centric metrics such as ARE95 and PARE10. Further, we uncover the root cause for the poor prediction accuracy of ML models as the inherent conflicting sample sequences in the finegrained cellular network throughput data.
more » « less
Full Text Available
LIFL: A Lightweight, Event-driven Serverless Platform for Federated Learning

Qi, Shixiong; Ramakrishnan, K K; Lee, Myungjin (May 2024, Proceedings of Machine Learning and Systems 6 (MLSys 2024) Conference)

Federated Learning (FL) typically involves a large-scale, distributed system with individual user devices/servers training models locally and then aggregating their model updates on a trusted central server. Existing systems for FL often use an always-on server for model aggregation, which can be inefficient in terms of resource utilization. They also may be inelastic in their resource management. This is particularly exacerbated when aggregating model updates at scale in a highly dynamic environment with varying numbers of heterogeneous user devices/servers. We present LIFL, a lightweight and elastic serverless cloud platform with fine-grained resource management for efficient FL aggregation at scale. LIFL is enhanced by a streamlined, event-driven serverless design that eliminates the individual, heavyweight message broker and replaces inefficient container-based sidecars with lightweight eBPF-based proxies. We leverage shared memory processing to achieve high-performance communication for hierarchical aggregation, which is commonly adopted to speed up FL aggregation at scale. We further introduce the locality-aware placement in LIFL to maximize the benefits of shared memory processing. LIFL precisely scales and carefully reuses the resources for hierarchical aggregation to achieve the highest degree of parallelism, while minimizing aggregation time and resource consumption. Our preliminary experimental results show that LIFL achieves significant improvement in resource efficiency and aggregation speed for supporting FL at scale, compared to existing serverful and serverless FL systems.
more » « less
Full Text Available

« Prev Next »

Search for: All records